Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions

نویسندگان

  • Foster J. Provost
  • Tom Fawcett
چکیده

Applications of inductive learning algorithms to realworld data mining problems have shown repeatedly that using accuracy to compare classifiers is not adequate because the underlying assumptions rarely hold. We present a method for the comparison of classifier performance that is robust to imprecise class distributions and misclassification costs. The ROC convex hull method combines techniques from ROC analysis, decision analysis and computational geometry, and --I--,LT.-.L11~. ..-.,I--.,-f --~-I---! aaapss r;nem 50 cne parr;iculars 01 analyzing iearned classifiers. The method is efficient and incremental, minimizes the management of classifier performance data, and allows for clear visual comparisons and sensitivity analyses.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis and Visualization of Classi er Performance : Comparison under Imprecise Class and Cost

Applications of inductive learning algorithms to real-world data mining problems have shown repeatedly that using accuracy to compare classiiers is not adequate because the underlying assumptions rarely hold. We present a method for the comparison of classiier performance that is robust to imprecise class distributions and misclassiication costs. The ROC convex hull method combines techniques f...

متن کامل

Analysis and Visualization of Classifier Performance with Nonuniform Class and Cost Distributions

Applications of machine learning have shown repeatedly that the standard assumptions of uniform class distribution and uniform misclassification costs rarely hold. Little is known about how to select classifiers when error costs and class distributions are not known precisely at training time, or when they can change. We present a method for analyzing and visualizing the performance of classifi...

متن کامل

ROC Analysis of a Linguistic Decision Tree Merging Algorithm

ROC analysis is an important tool for evaluation and comparison of classifiers in imprecise environments (i.e., class distribution and cost parameters are unknown). Area Under the Curve of ROC (AUC) is increasingly being recognized as a better measure for evaluating algorithm performance than accuracy. A bigger AUC value implies a better ranking performance for a classifier. Linguistic decision...

متن کامل

Cost-Sensitive Classifier Evaluation Using Cost Curves

The evaluation of classifier performance in a cost-sensitive setting is straightforward if the operating conditions (misclassification costs and class distributions) are fixed and known. When this is not the case, evaluation requires a method of visualizing classifier performance across the full range of possible operating conditions. This talk outlines the most important requirements for cost-...

متن کامل

Fault diagnosis in a distillation column using a support vector machine based classifier

Fault diagnosis has always been an essential aspect of control system design. This is necessary due to the growing demand for increased performance and safety of industrial systems is discussed. Support vector machine classifier is a new technique based on statistical learning theory and is designed to reduce structural bias. Support vector machine classification in many applications in v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997